AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Off-Policy Learning

# Off-Policy Learning

Tqc PandaPickAndPlace V1
This is a deep reinforcement learning model based on the TQC algorithm, specifically designed for the PandaPickAndPlace-v1 environment, used for robotic arm grasping and placing tasks.
Molecular Model
T
sb3
14
7
Sac Walker2d V3
This is a reinforcement learning model based on the SAC algorithm, specifically designed for the Walker2d-v3 environment to control bipedal robot walking.
Physics Model
S
sb3
43
0
Dqn MountainCar V0
This is a DQN agent model trained using stable-baselines3, specifically designed to solve reinforcement learning tasks in the MountainCar-v0 environment.
Molecular Model
D
sb3
578
1
Decision Transformer Gym Hopper Medium
This is a decision transformer model trained on medium-performance trajectories in the Gym Hopper environment, suitable for continuous control tasks.
Physics Model Transformers
D
edbeeching
6,518
6
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase